Using surface-enhanced Raman spectroscopy to probe artificial dye degradation on hair buried in multiple soils for up to eight weeks

The discovery of clandestine burials poses unique challenges for forensic specialists, requiring diverse expertise to analyze remains in various states. Bones, teeth, and hair often endure the test of time, with hair particularly exposed to the external environment. While existing studies focus on the degradation of virgin hair influenced by soil pH and decomposition fluids, the interaction between artificial dyes on hair and soil remains underexplored. This paper introduces a novel approach to forensic hair analysis that is based on high-throughput, nondestructive, and non-invasive surface-enhanced Raman spectroscopy (SERS) and machine learning. Using this approach, we investigated the reliability of the detection and identification of artificial dyes on hair buried in three distinct soil types for up to eight weeks. Our results demonstrated that SERS enabled the correct prediction of 97.9% of spectra for five out of the eight dyes used within the 8 weeks of exposure. We also investigated the extent to which SERS and machine learning can be used to predict the number of weeks since burial, as this information may provide valuable insights into post-mortem intervals. We found that SERS enabled highly accurate exposure intervals to soils for specific dyes. The study underscores the high achievability of SERS in extrapolating colorant information from dyed hairs buried in diverse soils, with the suggestion that further model refinement could enhance its reliability in forensic applications.

signal (inelastic scattering of light) by utilizing nanostructured surfaces, providing highly sensitive molecular fingerprinting for substances.In the context of hair, Kurouski and Van Duyne demonstrated how using gold nanoparticles could lead to the strong resonance of colorants found in dyes on dyed hair 10 .Since then, SERS has been used to detect and differentiate over 30 different dyes on hair, as well as probe the degradative effects and still detect the colorant on hair subjected to high heat and weeks of sunlight and lake water exposure [11][12][13][14] .
In this study, we employ SERS coupled with partial least squares discriminant analysis (PLS-DA) to assess the efficacy of detecting artificial dyes on hair buried in three distinct soil types for up to eight weeks.Our evaluation also extends to predicting the correct number of weeks since burial based on SERS spectra.This valuable information not only solidifies the role of SERS in forensic hair analysis but may also hold the potential to provide insights into post-mortem intervals.

Hair preparation
The hair used in this research remained unchanged, consisting of natural, untreated hair from a voluntarily donating 21-year-old Caucasian female.She was thoroughly briefed on the intended use of her hair.The hair was dyed using Ion brand hair dye either of Ion Jet Black (permanent black-"PBA"), Ion Sapphire (permanent blue-"PBU"), Ion Radiant Orchid (permanent purple-"PPU"), Ion Garnet (permanent red-"PRD), Ion Blackest Black (semi-permanent black-"SBA"), Ion Sapphire (semi-permanent blue-"SBU"), Ion Radiant Orchid (semi-permanent purple-"SPU"), or Ion Garnet (semi-permanent red-"SRD").The information for each hair dye and colorants can be found in the SI, Tables S1 and S2.A clean beaker was used to mix permanent hair dye and activator and a clean graduated cylinder was used to pour equal portions of each hair colorant onto each batch of hair.Permanent dyes were mixed with an Ion Sensitive Scalp Creme Developer in volume ratios consistent with the manufacturer's instruction label.The colorant was then gently rubbed in until all hair strands in each batch were completely coated.After the elapsed time indicated by the instruction label on the box passed, the hair was rinsed off under low pressure deionized water within a small stainless-steel strainer until the water running off was clear, after which the hair was left to air dry.All experiments were performed in accordance with NIH guidelines and regulations.
Previously reported study by our group showed that SERS could be used to distinguish hair from different races/ethnicities, age groups, and sexes 15 .Therefore, the reported results should be valid only for young female individuals of Caucasian race.In following studies, we expect to expand the reported results to other races/ ethnicities, age groups, and genders.However, the results of these studies are the subjects for separate studies.

Experimental treatment
Cactus, Palm & Citrate Potting Mix (The Scotts Miracle-Gro Company, Oregon, US) (henceforth, Soil Type A or "STA"), All Purpose Garden Soil (The Scotts Miracle-Gro Company, Oregon, US) (henceforth, Soil Type B or "STB"), and clay soil (collected from Bryan, Texas) (henceforth, Soil Type C or "STC") were all used as soil types for this experiment.The pH of each soil type was measured using a benchtop pH meter to be 6.38, 6.81, and 9.96, for soil types A, B, and C, respectively.In our previous study, we showed that substances with acidic pH, such as white wine and orange juice, drastically lower the accuracy of SERS-based identification of colorants on hair 16 .We hypothesized that the acidic pH facilitates the desorption of artificial colorants from the hair surface.Therefore, we reported pHs of the soils used in our study.Based on the reported pHs, we can conclude that none of the analyzed soils were powerfully acidic.As a result, no acid-driven degradation of colorants should be expected.
Soil was placed outside within ten-gallon pots, filled to an inch below the top, to endure realistic weather conditions within the College Station-Bryan area.The dyed hair was placed completely under the soil (but no more than 5 cm) so that none of the dyed hair was exposed directly to sunlight or other weather conditions.

Sample collection
Samples were buried in respective soils for a full week before each collection.During collection, hair groups were removed all-at-once from their soil to snip an inch from a few hair strands that were then sealed in a plastic bag and stored in a dark environment to be later used for scanning/analysis.Removed hair groups were then re-buried in the same position they were removed.The collection of samples stopped after eight weeks of exposure to the respective soils.

Raman spectroscopy
The gold nanoparticle (AuNP) solution, excitation wavelength of laser light, equipment, and power were chosen based on published methods from Esparza and coworkers 17 .The AuNPs were characterized using JEOL JSM-7500F Scanning Electron Microscopy (SEM) and SERS, Figs.S1, S2.SERS spectra were collected using a TE-2000U Nikon inverted confocal microscope, equipped with a 20 × objective.A solid-state laser generated 785 nm light, while power through each sample was kept at 1.8 mW.Scattered light was collected using the same magnification and directed using a 50/50 beam splitter into an IsoPlane-320 spectrometer (Princeton Instruments) equipped with a 600 groove/mm grating.Prior to entering the spectrometer, elastically scattered photons were blocked by a long-pass filter (Semrock, LP03-785RS-25).Inelastically scattered photons were collected using PIX-400BR CCD (Princeton Instruments).
Fifty spectra from each sample, comprising 15-20 spectra from each of three regions on one hair strand per sample group, were collected by placing each hair on a glass cover slide and applying ~ 5 µL of the AuNP solution described above.Hairs within groups that consistently returned spectra with full AuNP signature contribution were only scanned five times for the relevant sampling period.[Note: The absence of colorant molecules for proper spectral acquisition of these colorants can be attributed to a possible larger predisposition to degradation www.nature.com/scientificreports/within certain soils This susceptibility is likely influenced by the exclusive reliance on a single primary colorant compound, 1-Hydroxyethyl-4,5-Diamino Pyrazole, in both pertinent dyes (PPU and PRD).Additionally, while PBA shares this compound, it incorporates another colorant, Toluene-2,5-Diamine.]The strand of hair was coated by the 5 µL drop AuNP solution by moving the hair around the slide until the nanorod solution outlined ~ 10 mm in length (incidental of whether the strand was longer or shorter than 10 mm) of the strand of hair.The laser light was positioned on the hair medial-proximally, as had the most consistently intense peaks for bands of interest.Overall acquisition times ranged from 18 to 30 s.

Reference dye preparation
In order to extrapolate the (expected) source of the primary molecules responsible for resonances seen within spectra, the respective hair dyes for each group were analyzed without reaction to hair.To do so, 50 mg of each semi-permanent dye was placed on a glass coverslip and mixed with 5 uL of AuNPs.For permanent hair dyes, 50 mg of each dye was mixed 500 uL of developer, mixed by hand until homogenous and left to oxidize overnight.Once oxidized (i.e., the solution appeared to be its expected color), 50 uL of each activated dye was placed on a glass coverslip and 5 uL of AuNPs were added.All mixtures were analyzed with the same instrument above at 12-20 s acquisition times using 8 mW (785 nm) laser power.

Data analysis
All spectra were trimmed from 308 cm −1 to 1708 cm −1 (for noise reduction in analyses), smoothed (2nd order) using a Savitzky-Golay filter, baseline-corrected (2nd order) using automatic weighted least squares, and areanormalized (all data points of all spectra were normalized to their respective bands) before analysis using MAT-LAB (as displayed).Chemometric analysis of acquired spectra was done in MATLAB equipped with PLS_Toolbox 9.0 (Eigenvector Research, Inc., Manson, WA).For PLS-DA, cross-validations from full calibration models were employed unless stated otherwise.Pre-processing of each ANOVA graph and PLS-DA model was done using mean centering and 1st-derivative smoothing (n = 2, fl = 15 pt.).The number of latent variables (LVs) (loadings plots can be found in Figs.S3-S6) per model were selected based on the most appropriate root mean square error (in cross-validation) value for each model.

Results and discussion
In the SERS spectra acquired from PBA-dyed hair before placement in soils (control), we detected vibrational bands centered at 366, 451, 494, 579, 734, 757, 827, 871, 950, 1003, 1043, 1135, 1210, 1235, 1316, 1433, 1512, and 1592 cm -1 , Fig. 1 and Table S3.As time progressed, we observed a drastic change between the relative intensities of 1512 and 1433 cm −1 band peaks, corresponding to C-C stretching likely from aromatic rings of the colorants, attributing to the degradative effects of soils on PBA over time.Because the most notable difference is in the relative intensities of peaks centered at 1512 cm −1 , we performed ANOVA of PBA spectra at this peak, Fig. 2.
While the initial pattern of degradation is not apparently linear and well-distinguished, when time exposed groups were combined based on relative intensity range similarities (within blue dashed lines, Fig. 2), a more unique and linear model is given.
In the SERS spectra acquired from PBU-dyed hair before placement in soils, we detected vibrational bands centered at 335, 438, 470, 540, 565, 653, 684, 713, 804, 864, 903, 1022, 1076, 1156, 1184, 1242, 1320, 1361, 1396, 1455, 1508, 1591, 1639, and 1668 cm -1 , Fig. 1 and Table S3.As time progressed, we see a strong change between the relative intensities of peaks at 1361 cm −1 , for example, which also corresponds to C-C stretching from the aromatic rings of the colorants.Because of this, we performed ANOVA of PBU spectra at this band and found an overall strong positive correlation between peak height and exposure time.We further sought to increase linearity and uniqueness of groups by grouping weeks with similar relative intensity ranges together.
Raman bands peaking either higher or lower than the control spectra, and exhibiting slight fluctuations over time, may indicate colorants with higher susceptibility to degradation.The analysis of such bands contributes to enhanced fluorescence, revealing the remaining colorant during examination (Table 1).
Without an obvious extrapolation of a post-burial interval from the ANOVA results, we considered the use of machine learning, which instead of analyzing the variance at one peak, a model can be built using all qualities of multiple peaks by week per colorant.Therefore, we utilized PLS-DA to determine the accuracy of SERS-based identification of exposure time for hair buried in varying soils for up to 8 weeks, Table 2.We found that control and week one samples (from all soils) allowed for the best differentiation by PLS-DA, with overall accuracies of 99.25 and 85.42%, respectively.PPU and PRD colorants contributed the overall greatest differentiations between weeks with 78.52 and 88.89%, respectively.However, the reasoning for this is likely due to the full resonance contribution of the AuNPs used as discussed earlier as well as the relatively low number of spectra acquired compared to other dye groups.
From the modified groups in the ANOVA analysis of PBA and PBU shown in Fig. 2, we wanted to explore if PLS-DA improved at identifying the time exposed, Table 3.We found that PLS-DA allowed for the elucidation of exposed time of PBA-dyed hair for control with 100% accuracy, combined one-to-four weeks buried with 92.33% accuracy, and combined five-to-eight weeks buried with 91.5% accuracy.What's more is PLS-DA allowed for the elucidation of exposed time of PBU-dyed hair for control with 100% accuracy, combined one-to-five weeks buried with 95.47% accuracy, and combined six-to-eight weeks buried with 95.11% accuracy.This indicates that while most exposure times could not be well extrapolated from most dyes for combined soils, specific dyes may allow for an accurate range of weeks determination.One may ask how drastic the individual effects are from each soil on detecting each colorant.For this we calibrated (trained) a PLS-DA model with the control colorants spectra and validated (tested) the model using spectra separated by soil type, Table 4.We found that STA had the least drastic effects on PBA, PBU, SBU, SPU, and SRD with TPRs of 100, 100, 99.75, 92, and 100%, respectively.STA's ability to extrapolate hair dyed with PPU, PRD, and SBA, was poor with TPRs of 20, 37.5, and 6.5%, respectively.Additionally, we found that STB had small effects on PBA, PBU, SBU, SPU, and SRD with TPRs of 99.25, 100, 100, 98, and 99.75%, respectively.The effects of STB on PPU, PRD, and SBA dyed hair were shown to be significant with TPRs of 7.5, 47.5, and 6%, respectively.Finally, dyed hair buried in STC still allowed for highly accurate identification of PBA, PBU, SBU, SPU, and SRD groups, with TPRs of 100, 100, 99.75, 92, and 100%, respectively.Just as with STA and STB, STC had apparently large effects on hair dyed with PPU, PRD, and SBA, given TPRs of 20, 37.5, and 6.5%, respectively.
While the TPR results of PPU and PRD should be of no surprise since the spectra beyond control are quite different than the control, PLS-DA of SBA-dyed hair buried in each soil type consistently generated poor results.Upon further analysis into the reason for which this was occurring, it was noted that most spectra from SBAdyed hair past control were predicted as belonging to PBA-dyed hair spectra.This indicates that the colorants between PBA and SBA are very similar, which is known 10 , and in order to build a more reliable PLS-DA model, more spectra from SBA, after different environmental effects such as in this experiment are applied, should be uploaded for model calibration to cover more grounds, which is demonstrated in the full calibration model from all soils for all colorants, Table 5.Using spectra from all degradative time points, PLS-DA was still able to differentiate all models (save PPU and PRD) with over 90% accuracy, including SBA which jumped from an overall 4.17% prediction accuracy to a 98.24% prediction accuracy in this model.These results indicate an overall high achievability of SERS to extrapolate colorant information of dyed hairs buried in varying soils. https://doi.org/10.1038/s41598-024-57147-2

Figure 1 .
Figure 1.Averaged SERS spectra from hair colored with (A) PBA, (B) PBU, (C) SBA, (D) SBU, (E) SPU, and (F) SRD, buried in combined soils (Soil types A-C) with corresponding dye signatures.Gold dashed lines represent bands likely promoted by AuNPs and blue dashed lines represent all other bands from the colorant.

Figure 2 .
Figure 2. Kruskal-Wallis ANOVA multiple comparison graphs of relative intensities in spectra acquired from hair dyed with (A) PBA at 1512 cm −1 and (B) PBU at 1361 cm −1 , including after combining class groups for (C) PBA and (D) PBU, respectively.The solid-colored bars represent 95% confidence intervals for each class (control and weeks), the black circles represent mean values, and the blue dashed lines represent classification thresholds, determined by relevant confidence intervals.

Figure 3 .
Figure 3. Kruskal-Wallis ANOVA multiple comparison graphs of relative intensities in spectra acquired from hair dyed with (A) SBA at 1314 cm −1 , (B) SBU at 582 cm −1 , (C) SPU at 1588 cm −1 , and (D) SRD at 1601 cm −1 .The solid-colored bars represent 95% confidence intervals for each class and the black circles represent mean values.

Table 4 .
Control-calibrated PLS-DA confusion matrices results for prediction models tested using spectra grouped by soil type.

Table 5 .
Full calibration PLS-DA confusion matrix for all colorants from all soils and controls.